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1.  Introduction 

The  key  words  in  the  title  of  this  paper  are  "data  analysis."  for  in  the  categorical  data  area, 
as  in  almost  all  of  his  research  work.  William  G.  Cochran  was  concerned  with  the  practical 
aspects  of  statistical  methods  and  theory.  Although  his  early  papers  on  the  topic  (e.g.  Cochran 
1936a.  1940)  included  formal  mathematical  statistical  derivations,  their  focus  was  always  on 
actual  applications  and  on  ways  of  adjusting  theoretical  results  to  deal  with  the  data  at  hand. 
This  early  practical  orientation  extends  throughout  Coeh'an's  research  on  the  statistical  analysis 
of  categorical  data,  research  that  spanned  five  decades. 

This  review  of  Cochran's  contributions  to  categorical  data  analysis  focuses  on  what  1  take  to 
be  his  four  major  contributions: 

(a)  the  distribution  of  A:  test  statistics  in  the  presence  of  small  expectations,  and  the 
correction  >r  continuity. 

(b)  the  Q-test  for  comparing  percentages  in  matched  samples. 

(c)  methods  for  strengthening  A -  -tests. 

(d)  the  Cochran  test  for  combining  results  from  several  2x2  tables. 

Each  of  these  contributions,  by  itself,  would  have  been  sufficient  to  establish  any  ordinary 
statistician's  claim  as  a  statistical  innovator.  Taken  together  they  represent  pathbreaking  work 
by  an  extraordinary  statistician,  who  was.  at  the  same  lime,  making  major  contributions  to 
other  areas  of  statistics,  such  as  the  design  of  experiments  and  sample  surveys. 

Although  the  contributions  listed  above  were  all  published  in  the  2940's  and  1950's.  Cochran 
continued  to  monitor  the  progress  of  research  on  categorical  data  analysis,  especially  during  the 
renaissance  of  the  mid-1960's.  out  of  which  came  new  methods  for  the  analysis  of  loglincar 
and  logit  models.  I  suspect  this  continuing  interest  stemmed  from  his  understanding  the 
practical  importance  of  this  work  for  other  problems  in  which  he  was  interested.  I  remember 
Bill  approaching  me  in  1967  to  discuss  his  reply  to  a  letter  from  Berkson  that  dealt  with 
simultaneous  logit  equations  for  a  trichotomous  response.  In  later  years,  we  had  several 


conversations  about  the  work  1  was  doing  with  Fred  Mostcller.  Yvonne  Bishop,  and  Paul 

Holland  on  loglinear  models,  and  in  Chapter  20  of  the  7th  edition  of  Snedecor  and  Cochran 
(1980).  Bill  added  several  paragraphs  on  the  loglinear  model,  and  how  it  provides  a 

generalization  for  the  logit  methods  described  there  (and  in  the  earlier  6th  edition). 

2.  The  Distribution  of  A2  Test  Statistics  in  Presence  of  Small  Expected  Values 

In  his  first  paper  on  the  analysis  of  categorical  data  (1936a).  written  while  he  was  a  member 
of  the  staff  of  the  Rothamsted  Experiment  Station,  Cochran  presents  a  detailed  analysis  of  the 
results  of  an  experiment  focussing  on  the  distribution  of  diseased  plants  in  a  4x4  Latin  Square 
layout  of  plots,  each  with  6  rows  containing  15  tomato  plants.  After  beginning  with  an  actual 
field  map  of  the  diseased  plants,  he  presents  the  variance  test  for  the  homogeneity  of  N 
binomial  proportions,  showing  (using  an  argument  suggested  by  Fisher)  that  it  is  identical  to 
the  usual  A;  test  for  homogeneity  in  a  2xN  table.  In  applying  this  test  to  the  tomato  plant 
data.  Cochran  expresses  concern  about  the  effeci  of  small  expectations  on  the  A:  approximation 
for  the  distribution  of  the  test  statistic,  and  he  illustrates  how  one  can  study  the  effect  by 
looking  at  the  exact  distribution  of  A:  (something  that  is  not  explained  in  detail  until  the 
second  1936  paper).  He  then  illustrates  the  results  of  N  =  10  groups  of  n  =  9  plants  each 
where  the  overall  disease  rate  is  p  =  0.2.  The  expectations  are  1.8  and  7.2  (the  former  was 
considered  quite  small  back  in  1936).  yet  the  highest  percentage  discrepancy  between  the  exact 
probability  and  the  A:  approximation  is  about  16%  at  around  p  =  .2.  The  A-'  approximation  is 
surprisingly  good,  especially  at  p  =  .05  and  p  =  .01. 

The  remainder  of  this  first  1936  paper  focusses  on  supplementary  analyses,  such  as  an 

analysis  of  variance  on  the  number  of  diseased  plants  to  take  into  account  the  row  and  column 
structure  (this  is  a  section  of  the  paper  that  might  have  benefited  from  the  modern  methods 
for  logit  models),  and  the  issue  of  contagion  of  disease  using  a  test  for  runs.  All  in  all. 
Cochran  presents  an  illuminating  example  of  data  analysis,  and.  in  the  process  of  doing  so.  he 
introduces  several  technical  proofs  of  -results,  as  well  as  a  first  attempt  at  dealing  with  the 


problem  of  small  expectations. 


In  what  appears  to  be  a  companion  piece  to  the  first  paper.  Cochran  (1936b)  focusses 

directly  on  the  effect  of  small  expectations  on  the  binomial  variance  test  (and  thus  the  x:  test 

for  2xN  tables).  Here  we  learn  that  by  "the  exact  distribution  of  X:,"  Cochran  means  the 
distribution  of  A:  based  on  the  conditional  distribution  given  both  the  row  and  column  totals 

in  the  2xN  table.  That  is.  suppose  x  .x„ . x  are  the  number  of  successes  out  of  n  for  N 

binomials,  each  with  probability  of  success  p.  Then  Cochrar.  looks  at  the  conditional 

distribution  of  x^x, . x^  given  the  total  T  =  x  +  x,  +  ...  +  x  .  i.e.  at  the  probability 

(n!)KT!(Nn-T)l 

P’  =  -  . 

(Nn)in  [x  !(n-x  )!] 

1  l  i 

(2.1) 

Because  the  x 's  are  typically  not  all  distinct  the  probability  in  (2.1)  corresponds  to  N1/IT  a! 
possible  arrangements  of  the  N  x  s.  where  there  are  a  0's.  a,  I  s.  etc.  Thus  the  exact 
probability  is 

N! 

P  =  =—  P’ 
n  a ! 

,i  ,i 

(2.2) 

The  exact  distribution  of  X:  then  comes  from  an  enumeration  of  all  possible  sample 
configurations,  and  the  calculation  of  the  probability  in  (2.2)  and  the  corresponding  value  of 


(x  -T/N); 

X-  =  - ! - 

T(nN-T)/N:n 

(2.3) 


Cochran  never  presents  any  discussion  of  or  formal  justification  for  why  the  relevant 
probability  of  interest  should  be  based  on  the  conditional  distribution  given  the  sufficient 
statistic  T.  although  presumedly  he  was  strongly  influenced  by  Fisher  and  Yates  on  this  issue. 
(For  a  modern-  day  justification  of  the  use  of  the  conditional  distribution,  with  which  1  am 
not  in  total  sympathy,  see  Cox.  1970.  pp.  44-46.). 
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Cochran  explains  this  second  1936  paper  by  taking  us  through  a  detailed  example  for  n  =  8. 
p  =  .25.  and  N  =  4.  Although  he  does  not  explain  it  in  the  text,  he  appears  to  have  taken  T 
=  Nnp  =  8  (somewhat  arbitrarily).  A  summary  table  includes  13  of  the  15  possible  values  of 
the  statistic  (the  two  highest  values  being  omitted),  the  corresponding  exact  tail  probabilities 
(P  as  in  (2.2)).  the  tabular  values  from  the  X:  distribution,  and  an  adjusted  tabular  X2  value 
corrected  for  continuity  (more  on  this  later).  In  this  and  four  other  examples,  Cochran  finds 
tolerable  percentage  discrepancies  in  the  range  0.1  ^  P  ^  0.005  resulting  from  the  use  of  the 
tabular  values,  even  when  np  is  as  low  as  0.9.  and  a  tendency  for  the  adjusted  tabular  X: 
values  to  overestimate  the  exact  tail  values.  P. 

For  the  special  case  of  np  and  n  small  but  N  large.  Cochran  investigates  the  normal 
approximation,  using  expansions  for  the  mean  and  variance,  and  he  finds  a  result  somewhat 

different  from  the  usual  normal  approximation  to  the  X:  distribution.  Here  and  earlier  he 
investigates  not  only  the  binomial  problem  but  also  the  Poisson  limit  in  which  n  -»  oc.  p  -»  0. 
and  np  -»  m. 

The  final  section  of  the  second  1936  paper  recalls  Fisher's  justification  of  the  validity  of  the 
X:  statistic,  i.e.  that  it  appi cximates  minus  twice  the  loglikelihood  ratio.  For  the  Poisson  limit, 
this  latter  quantity  is 

G:  =  2lxlog^  . 

'  1  m 

(2.4) 

We  note  that  expression  (2.4)  can  be  of  use  as  a  generic  form  for  the  loglikelihood  ratio 

statistic  that  parallels  the  generic  Pearson  formula  X:  =  Z(x-m):/m.  Cochran  then 

returns  to  his  example  with  m  =  2  and  N  =  4  and  shows  that  the  tabular  X:  values,  both 

corrected  and  uncorrected,  underestimate  the  tail  probabilities  calculated  from  the  exact 
distribution  for  G:.  although  the  agreement  is  "not  sensibly  worse"  than  that  for  the  Pearson 
statistic.  X-\ 


In  this  pair  of  1936  papers  we  see  Cochran's  attempt  to  grapple  with  the  difficult  practical 
problem  presented  by  small  expectations  using  a  mixture  of  mathematical  statistics  theory  and 
tabular  calculations.  These  papers  leave  many  practical  and  theoretical  questions  unanswered, 
and  Cochran  returned  to  some  of  these  in  his  subsequent  research. 

In  a  1942  paper,  for  example,  he  focussed  his  attention  on  the  correction  for  continuity  for 
A:  tests,  giving  a  careful  statement  of  how  the  A;  distribution  approximation  comes  about  if 
one  regards  "the  exact  discontinuous  distribution  as  a  grouping  of  the  tabular  distribution,  with 
each  possible  value  of  the  exact  A:  representing  all  values  of  the  continuous  A:  which  are 
nearer  to  it  than  to  any  other  permissible  value  of  A:."  Using  this  formulation.  Cochran  is 

able  to  state  a  general  rule  in  applying  corrections  for  continuity: 

Calculate  A;  by  the  usual  formula.  Find  the  next  lowest  possible  value  of  A-'  to  the 
one  to  be  tested,  and  use  the  tabular  probability  for  a  value  of  A:  midway  between 
the  two. 

If  the  possible  values  of  A:  are  closely  spaced  together,  the  probabilities  given  by 
the  uncorrected  A:  and  the  corrected  A:  may  differ  only  by  an  amount  that  is 
regarded  as  negligible.  In  this  case  the  correction  may  be  ignored. 

He  notes  that  this  general  rule,  when  applied  to  A-'  for  the  2x2  table,  results  in  the  usual 

Yates  correction. 

Cochran  then  turns  to  the  special  problem  which  occurs  when  a  A;  statistic  can  be 
partitioned  into  single-degree-of-freedom  components.  He  argues  both  on  theoretical  grounds 
and  numerically  that,  if  the  total  A:  is  to  be  computed  as  the  sum  of  its  components,  it  is 
appropriate  to  add  uncorrected  A-’  values  and  then  correct  the  total  using  the  general  rule,  if 
necessary.  Adding  corrected  values  badly  overcorrects,  and  the  agreement  with  the  exact 
distribution  gets  steadily  worse  as  the  number  of  A:  components  grows. 

At  the  end  of  this  1942  paper.  Cochran  returns  to  the  issue  of  small  expectations,  and 
develops  a  new  approach  to  exploring  the  effect  of  one  small  expected  value.  He  begins  with 
a  trinomial  problem  and  takes  three  examples  with  n  =  20.  where  m|/mi  -  3/2  but  m  varies 
from  1.0  to  0.1.  If  we  treat  the  m's  as  known,  then 
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1  :  m 

1 

(2.5) 

has  an  asymptotic  X2  distribution  with  2  degrees  of  freedom.  Cochran  compares  the  exact 
distribution  with  the  tabular  .dlues,  once  again  finding  fairly  good  agreement  down  to  about 
p  =  .03  for  both  m.  =  1  and  m,  =  .5.  But  for  m.  =  .1  he  notes  that  “the  tabular  is 
useless  as  an  approximation  throughout  the  whole  of  the  region  between  p  =  .1  and  p  =  .01." 
Because  of  the  serious  underestimation  of  the  tail  area  probabilities  for  m.  =  .1.  and  beyond 
p  =  .03  for  m.  =  .5  and  m.  =  1.  Cochran  suggests  that  a  correction  for  continuity  cannot 
repair  the  discrepancy,  and  thus  discontinuity  is  no;  its  principal  cause.  This  leads  him  to 
approximate  the  exact  tail  area  for  the  /-cell  multinomial  problem  in  the  presence  of  a  small 
expectation  in  the  rth  cell  by 


(k-m.): 


(2.6) 

where  k  is  Poisson  with  mean  mA  .  This  approximation  fits  extremely  well  for  the  trinomial 
exact  distribution  with  m,  =  .1.  Finally  Cochran  uses  this  approximation  to  come  up  with  the 
following  recommendations  for  the  minimum  degrees  of  freedom,  r  -  1.  to  ensure  that  the 
usual  approximation  is  accurate  to  within  20%  at  the  .05  and  .01  level,  for  various  values  of 
m : 

r 

Smallest  expectation  0.1  0.5  1.0  2.0  3.0  5.0 

Minimum  d.f.  ?  25  10  6  4  2 

The  paper  concludes  with  a  brief  look  at  the  problem  of  two  small  expectations. 

A  decade  later.  Cochran  (1952.  1954)  was  still  refining  his  advice  on  the  minimum 
expectations  to  be  used  in  A:  tests.  The  following  recommendations  from  Cochran  (1954)  are 
reasonably  consistent  with  his  earlier  research  described  above,  but  were  based  on  some 
additional  but  unpublished  calculations: 

la)  Goodness  of  fit  tests  of  unimoda /  distributions  isuch  as  the  normal  or  Poisson). 
Here  the  expectations  will  be  small  only  at  one  or  both  tails.  Group  so  that  the 


minimum  expectation  at  each  tail  is  at  least  1. 

ib)  The  2X2  table  Use  Fisher's  exact  test  (1)  if  the  total  N  of  the  table  <  20.  (ti)  if  20 
<  N  <  40  and  the  smallest  expectation  is  less  than  5.  ...  If  N  >  40  use  * corrected 
for  continuity. 

ic)  Contingency  tables  with  more  than  1  d.t.  If  relatively  few  expectations  arc  less  than 
5  (say  in  1  cell  out  of  5  or  more,  or  2  cells  out  of  10  or  more),  a  minimum 
expectation  of  1  is  allowable  in  computing  A:. 

Contingency  tables  with  most  or  all  expectations  below  5  are  harder  to  prescribe  for. 
With  very  small  expectations,  the  exact  distribution  of  A:  can  be  calculated  without  too 
much  labor.  .  .  .  If  ha:  less  than  30  degrees  of  freedom  and  the  minimum 

expectation  is  2  or  more,  use  of  the  ordinary  tables  is  usually  adequate.  If  has 
more  than  30  degrees  of  freedom,  it  tends  to  become  normally  distributed,  but  when 
the  expectations  are  low.  the  mean  and  variance  are  different  from  those  of  the 
tabular  A:.  .  .  .  Compute  the  exact  mean  and  variance,  and  treat  as  normally 

distributed  with  that  mean  and  variance. 


A  hidden  issue  in  all  of  Cochran's  work  on  tests  in  the  presence  of  small  expectations  is 
the  relevant  reference  distribution  for  the  test  statistic.  Cochran  consistently  uses  the  "exact 
distribution"  of  X:.  conditional  on  the  values  of  the  margins,  as  his  reference  distribution. 
This  was  consistent  with  the  work  of  Fisher  and  Yates,  and  allowed  Cochran  actually  to 
compute  distributions  in  the  pre-  computer  era  because  he  needed  to  worry  about  only  a 
relatively  small  number  of  possible  values  for  the  test  statistic.  An  alternative  to  Cochran's 
approach  is  to  use  the  large-sample  distribution  rather  than  the  exact  distribution  as  the 
reference  distribution.  If  one  of  our  aims  is  to  correct  the  statistic  to  conform  more 
closely  to  the  actual  distribution,  the  usual  correction  for  continuity,  suggested  for  use  by 
Cochran,  results  in  an  overly  conservative  test  (sec  e.g.  Plackett.  1%4:  Grizzle.  196":  and 
Conover.  1974).  Moreover,  the  discrepancy  in  behavior  between  the  X:  and  G:  statistics  from 
this  large-sample  perspective  can  be  attributed  to  the  differing  influence  of  cells  with  very 
small  observed  as  opposed  to  expected  counts  (see  Larntz.  1978:  Fienberg.  1979). 

3.  The  Q-Test  for  Matched  Proportions 

Cochran's  (1950)  paper  on  the  comparison  of  percentages  in  matched  samples  is  a  gem.  He 
begins  with  the  case  of  two  matched  proportions  and  presents  the  well-known  McNemar- 
Mostcller  test.  He  then  generalizes  this  test  statistic  to  the  case  where  there  arc  c  >  2 
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matched  samples.  The  setup  is  as  follows. 

Consider  an  n  x  c  table  listing  the  results  on  c  binary  items  (the  c  matched  samples)  for  n 
individuals.  Lei  x  be  1  if  there  is  a  success  for  individual  i  on  item  j.  and  assume  that  the 

II 

total  number  of  successes  in  each  row  as  fixed  (i.e.: 

u  =  I  x 

■  I- 1  i. 

(3.1) 

is  fixed  for  i  =  1.2 . n).  Cochran  proposes  a  test  statistic  based  on  randomization  of  zeros 

and  ones  within  rows.  Since  rows  containing  only  0's  or  only  l's  (i.e.  rows  for  which  u  =0  or 
u  =c)  aren't  changed  under  such  a  randomization,  they  play  no  part  in  Cochran's  test  statistic, 
and  thus  thev  can  be  eliminated  from  consideration.  The  arra\  f  x  }  is  thus  reduced  to  size 

*  *  Ij 

r  x  c  .  where  the  row  total  u  can  take  values  1.  2 .  c-1. 

Cochran's  Q-test  for  tne  equality  of  the  proportions  in  the  matched  samples  is  then  based 

on  the  sum  of  squared  deviations  of  the  c  column  toLals.  T.  of  the  r  x  c  table,  suitabh 

\ 

normalized  so  that  the  resulting  quantity  follows  an  asymptotic  distribution: 

c(c-l)I(T  -T); 

Q  =  — - i — -  . 

c(Xu  )  -  (Su  *) 

l  i 

(3.2) 

Under  the  randomization  distribution  with  fixed  row  totals,  the  asymptotic  distribution  of  Q  is 
with  c-1  d.f..  and  Cochran  presents  a  heuristic  binomial-like  argument  leading  to  this 
result.  For  the  case  c=2.  Q  reduces  to  the  McNemar  lest  statistic.  Cochran's  key  step  in  the 
c  >  2  case  is  the  use  of  the  randomization  distribution,  but  this  procedure  actual!}  specifies  a 
null  hypothesis  that  is  far  more  restrictive  than  the  null  hypothesis  of  interest,  i.e.  equality  of 
proportions  for  the  c  items. 

An  alternative  way  to  view  the  matched  proportions  problem  is  to  treat  each  row  of  the 

(x  }  arrav  as  an  independent  observation  from  a  2"  contingencv  table.  The  hvpothesis  of 

1.1 

equality  of  matched  proportions  can  then  be  translated  into  the  hypothesis- of  homogeneity  of 


the  one-way  marginal  proportions  in  the  2'  table.  The  randomization  distribution  used  to 
derive  the  Q  statistic  for  the  {x  }  array  implies  complete  symmetry  (under  permutation  of 
dimensions)  for  the  2'  table.  As  Bishop.  Fienbcrg.  and  Holland  (1975.  Chapter  8)  note. 
Cochran’s  Q-test  is  then  equivalent  to  a  likelihood  ratio  test  for  the  model  of  complete 
symmetry  (or  one-way  marginal  homogeneity)  given  the  model  of  one-way  quasi-symmetry. 
Although  Cochran  was  quite  explicit  about  the  complete  symmetry  implicit  in  the  randomization 
justification  of  the  Q  test,  many  of  those  who  subsequently  investigated  the  properties  of  the 
test  failed  to  understand  its  implications. 


There  is  another  interesting  aspect  of  the  Q-tesi  which  links  it  to  the  modern  literature  on 
loglincar  model:  for  contingency  table  analysis.  Placketi  (1981).  in  a  very  brief  section  on  the 
2nd  edition  of  his  monograph  on  the  analysis  of  categorical  data,  notes  thai  Cochran's  Q- 
staiisuc  can  also  be  viewed  as  a  means  for  testing  ihe  equality  of  item  parameters  in  the  Rasch 
(1980)  model.  That  is.  if 


P(x  =1) 
P(x  =0) 


then  Q  can  be  used  to  test 


H:  »  =0 


for  all  i. 


This  observation,  although  it  seems  simple  on  the  surface,  is  directly  related  to  new  and 
exciting  ideas  for  the  analysis  of  the  Rasch  model  using  loglinear  models  applied  to  the 
corresponding  2"  contingency  table,  that  have  recently  been  proposed  by  Duncan  (1982).  and  by 
Tjur  (1981). 


In  the  1950  paper.  Cochran  also  investigates,  for  8  specific  cases,  the  small  sample  behavior 
of  Q  by  comparing  the  approximations  (with  and  without  a  correction  for  continuity  based 
on  the  general  rule  he  had  proposed  in  1942  with  the  exact  distribution  based  on 
randomization.  He  also  introduces  an  F  approximation  which  fares  poorly.  As  a  result  of 


these  comparisons.  Cochran  concludes  that  the  use  of  the  uncorrected  ■*  approximation  is 
superior,  and  quite  adequate  for  most  situations  jo  which  the  Q  statistic  should  be  applied. 


In  retrospect,  what  makes  the  1950  paper  on  matched  proportions  so  remarkable  is  how 
complete  and  how  innovative  it  was.  on  the  one  hand,  and  yet  how  many  problems  it  opened 
up  for  other  statisticians  to  investigate,  subsequently,  on  the  other.  This  combination  was  a 
characteristic  of  Cochran's  writings:  he  was  always  thorough,  but  he  also  pointed  out  aspects  ol 
problems  worthy  of  further  investigation,  and  encouraged  others  to  pursue  them. 

4.  Methods  for  Strengthening  A2 


In  another  remarkable  paper.  Cochran  (1954)  provided  an  incisive  review  of  methods  for 
strengthening  or  supplementing  the  common  uses  of  the  A-  test,  and  at  the  same  time  he 
presented  two  new  and  important  classes  of  tests.  The  first  of  these,  discussed  in  this  section, 
is  a  class  of  1  d.f.  tests  for  linear  functions  of  the  deviations  between  observed  and  expected 
counts.  (A  full  justification  of  these  tests,  as  well  as  a  generalization  to  multiple  degree-of- 
freedom  tests,  was  subsequently  given  in  Cochran  (1955).)  The  second  new  class  of  tests  was 
for  combining  results  from  several  2x2  tables.  Because  of  the  importance  ot  the  latter,  we 
treat  it  as  a  separate  topic  in  Section  5. 


Cochran  begins  the  1954  paper  by  noting  that: 

In  ehis  paper  1  want  to  discuss  two  kinds  of  failure  to  make  the  best  use  of 
tests  which  I  have  observed  from  time  to  time  in  reading  reports  of  biological 
research.  The  first  arises  because  A:  tests,  as  has  often  been  pointed  out.  are  noi 
directed  against  any  specific  alternative  to  the  null  hypothesis.  ...  No  attempt  is 
made  to  detect  any  particular  pattern  of  deviations  (f  -  m  )  that  may  hold  if  the 
null  hypothesis  is  false.  One  consequence  is  that  the  usual  A;  tests  are  often 
insensitive,  and  do  not  indicate  significant  results  w'hen  the  null  hypothesis  is  actually 
false.  Some  forethought  about  the  kind  of  alternative  hypothesis  that  is  likely  to 
hold  may  lead  to  alternative  tests  that  are  more  powerful  and  appropriate.  Further, 
when  the  ordinary  A;  test  does  give  a  significant  result,  it  does  not  direct  attention 
to  the  way  in  which  the  null  hypothesis  disagrees  with  the  data,  although  the  pattern 
of  deviations  may  be  informative  and  suggestive  for  future  research.  The  remedy 
here  is  to  supplement  the  ordinary  test  by  additional  tests  that  help  to  reveal  the 
significant  type  of  deviation. 


The  remedy  for  both  kinds  of  failure  is  often  the  use  of  a  single  degree  of  freedom,  or  a 
group  of  degrees  of  freedom,  from  the  tola!  and  Cochran  explores  such  remedies  for  (a) 
the  goodness-of-fit  test  for  the  Poisson  distribution,  the  binomial  distribution,  and  the  normal 
distribution,  and  (b)  two-way  contingency  tables. 


For  the  distributional  goodness-of-fit  tests.  Cochran  presents  new  results  for  linear  functions 


L  =  Ig  (f  -  m) 


(4.1) 


where  f  is  the  observed  frequency,  m  the  estimated  expected  frequency  in  the  /th  cell,  and 

\  ,  l 

the  g  are  weights  selected  in  advance  in  a  way  to  make  L  sensitive  to  some  likely  alternative 
hypothesis.  Then 


X-  = 


L- 

."arT 


(4.2) 


is  asymptotically  distributed  as  with  1  d.f.  when  the  distribution  being  fitted  is  appropriate. 
In  applying  this  test  in  the  specific  cases  of  the  Poisson,  binomial,  and  normal.  Cochran 
presents,  in  particular,  the  test  statistic  for  a  single  deviation.  He  is  careful  to  discuss  the 
multiplicity  problem  that  results  from  applying  such  tests  to  abnormally  large  deviations,  after 
examining  the  data,  and  he  suggests  a  Bonferroni  approach  to  the  calculation  of  significance 
levels  in  such  instances. 


Cochran  then  turns  to  the  contingency  table  problem,  focussing  on  the  subdivision  of  degrees 
of  freedom  in  the  2xN  case  with  fixed  column  totals,  i.e.  N  binomials  with  probabilities  of 

success,  p .  i  =  1.2 . N.  and  totals  n  .  i  =  1.2 . N.  He  first  looks  at  a  division  of  the  N 

columns  into  N  and  N,  =  N  -  N(.  and  subdivides  into  3  components:  (i)  comparing  the 
overall  proportion  of  success  in  the  first  columns  with  that  in  the  last  (1  d.f.).  (ii) 
comparing  variation  of  proportions  within  the  first  N  columns  (N | — 1  d.f.).  (iii)  comparing 
variation  of  proportions  within  the  last  N,  columns  (N-N^l  d.f.).  The  subdivision  he  suggests 
gives  an  additive  partition  of  A:,  but  Cochran  notes  that  when  the  1  d.f.  component,  (i).  is 


i: 

significant,  this  partition  needs  to  be  supplemented  by  direct  ^ ■— tests  for  (11)  and  (iii).  thercbi 
yielding  a  non-additive  partition. 

Other  1  d.f.  components  in  the  2xN  table  which  Cochran  examines  include  (a)  a  1  d.f.  test 
for  linear  regression  of  the  proportions  on  some  auxiliary  variable,  z:  (b)  a  1  d.f.  comparison 
of  mean  scores  for  the  auxiliary  variable  (which  turns  out  to  be  identical  to  (a)):  and  (c)  a 
sequence  of  N-l  cumulative  1  d.f.  comparisons  of  the  column  proportions.  For  the  latter 
tests.  Cochran  suggests  partitioning  the  sum  of  squares  in  the  numerator  of  A-'  into 

njn.p,  - 
n  (n  *  n  ) 

i  i  ; 

(4.3) 


njn  "p  *  nj?  -  (n,  - 
(n  *  n_)(n  -  n,  *  n.l 

i  -  - 


and  for  the  general  term. 

n  {n^  *  *  n^p  -  (n,  -  ...  *  nip1 

(n  n  Mn  *  ...  -  n  ) 

1  i  i  H 


(4.4) 


(4.5) 


Note  that  (4.3)  is  based  on  the  2x2  table: 


X 

1 

x^ 

n  -x 

n,-x. 

n  n. 


and  that  each  subsequent  component  cumulates  the  entries  in  the  columns  already  examined  and 
compares  them  with  those  in  the  next  column.  Thus  the  general  term  in  (4.5)  is  based  on 


Ir 

X 

X 

i 

T*  1 

I' 

(n  -  x  ) 

n  -  x 

I- 

l  l 

r*  1  r*  l 

I- 

n 

n 

-  i  i  r*  1 


(4.7) 
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Cochran  s  additive  partition  into  ( N — 1 )  1  d.f.  A-'  components  involves  dividing  each  of  (4.3). 
(4.4).  (4.5).  etc.  b>  p(!-p).  where 


P  = 


(4.8) 


Cochran  also  mentions  Lancaster's  non-additive  partition  of  A:.  which  uses  as  the 
denominator  for  the  ah  component.  p''!  ( l  — p ' ' 1  ).  where 


P 


■r-  1  • 


X 


n 

i 


/  (4.9) 

This  approach  >iclds  the  usual  1  d.f.  A:  tests  for  each  of  the  2x2  tables  of  the  form  (4.7).  It 
is  interesting  to  observe  that  Cochran  failed  to  note  that,  if  likelihood  ratio  tests  are 
substituted  for  4-  tests  in  each  of  the  2x2  tables,  the  Lancaster  partition  is.  in  fact,  additive 
(Kullback.  19S9).  This  likelihood  ratio  partitioning  of  a  2xN  table  leads  rather  naturally  to  the 
following  alternative  1  d.f.  test  for  an  increasing  (or  decreasing)  shift  in  the  p's.  suggested 
recenlli  by  Stewart  (1981). 


The  N-l  2\2  tables  of  the  form  (4.7)  can  be  put  together  to  form  a  2x2x(N-l)  three-way 
contingency  table.  The  likelihood  ratio  test  for  independence  in  the  original  2xN  table  is 
identical  to  the  test  for  conditional  independence  with  the  N-l  2x2  tables  (this  is  the 
consequence  of  the  Kullback  partitioning  result).  Cal!  the  test  statistic  G*.  If  we  let  the  no- 
2nd-order  interaction  likelihood  ratio  test  statistic  be  G;.  then 


±G:  =  G'  -  G; 

is  a  1  d.f.  component  of  G‘  which  is  sensitive  to  shift  alternatives  of  the  form: 

p  =  — ^ _  . 

(1-P,>C  +  P, 


(4.10) 


(4.11) 
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Clinical  Change 

Marked 

Moderate 

Slight 

Same 

Worse 

Degree  of 

0-7  11 

2" 

42 

53 

11 

Infiltration 

8-15 

15 

16 

13 

Table  4.1 

Data  on  Clinical  Change  by  Degree 
of  Infiltration  [Cochran  (1954.  p.435)] 


The  standard  tests  for  independence  applied  to  the  data  of  Table  4.1  yield 

X:  =  6.87  . 

G:  =  7.28  . 

each  with  4  d.f.  The  value  of  Cochran's  1  d.f.  test  for  regression  ts  6.67  (where  the  z's 
for  the  columns  are  -3.  -2,  -1.  0.  and  1). 


Next  we  examine  the  data  as  a  series  of  4  2x2  tables,  with  columns  cumulated  from  the  left 


as  illustrated  in  Table  4.2.  A  test  for  conditional  independence  of  degree  of  infiltration  and 
clinical  change,  given  "layer."  yields 

X:  =  6.64  . 

I 

G:  =  7.28  . 

I 

with  4  d.f.  Note  that  G:  takes  the  identical  value  as  that  for  the  likelihood  ratio  test  in  the 

I 

original  table  (numerical  confirmation  of  Rollback's  result).  The  test  for  no*2nd-order 
interaction  yields 

G;  =  1.61  . 

with  3  d.f..  and  thus  the  1  d.f.  component  of  Gj  which  tests  for  Stewart's  shift  alternative  in 
(4.13)  is 


AG:  =  7.28  -  1.61  =  5.67. 


This  value  is  quite  close  to  that  for  the  regression  1  d.f.  a:  test,  as  we  might  expect. 


Cochran  would  surely  have 

found  this  new  l 

d.f.  alternative  test 

quite  appealing,  especialh 

given  its  indirec'  relationship 

to  his  own  test 

for  combining  2x2 

tables,  discussed  in  the 

following  section. 

Degree  of 

Clinical  Change 

Layer 

Infiltration 

First  Proportion 

Second  Proportion 

0-7 

11 

27 

1. 

(column  1  vs. 

8-15 

7 

15 

column  2) 

0-7 

38 

42 

(columns  1  and  2 

8-15 

22 

16 

vs.  column  3) 

0-~ 

80 

53 

3. 

(columns  1.2.  and  3 

8-15 

38 

13 

vs.  column  4) 

0-7 

133 

11 

4. 

(columns  1.2.3.  and  4 
vs.  column  5) 


8-15 


51 

Table  4.2 


1 
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The  Combination  of  2x2  Tables 

In  one  of  the  final  sections  of  his  1954  paper.  Cochran  turns  to  the  problem  of  developing 
a  combined  test  of  significance  for  the  difference  in  proportions  as  displayed  in  a  scries  of 
2\2  tables. 

Suppose  that  for  the  / th  2x2  table  we  have  sample  sizes  n  and  n  .  observed  difference  in 
proportions  a  =  \  ,'n  -  \  ,/n  .  and  marginal  (combined!  proportion  p  =  (\  ♦>;.)'(n  -r.  ). 
Then  Cochran's  test  criterion  is  (in  its  1  d.f.  version) 


(Zw  d  ); 

Zw  p  ( 1  — p  > 


where 


n  n  . 

Vi  = 

n  -n 


(5.1) 


(5.2) 


We  now  know  that  C*  is  asymptotical!}  equivaleni  to  the  UMP  unbiased  tesi  for  conditional 
independence  (i.c.  independence  in  each  2x2  table)  versus  the  alternative  of  a  constant  odds- 
ratio  (i.e.  no-2nd-order  interaction)  across  2x2  tables  (Birch.  19o5>. 


To  illustrate  Cochran's  lest  we  applv  it  to  the  four  2x2  tables  in  Table  4.2: 

(10.5518); 

C-  = - =  4.85. 

22.9886 

Since  this  is  significant  at  the  .05  level,  we  are  able,  with  Cochran's  1  d.f.  test,  to  detect  the 
presence  of  the  shift  or  regression  alternative,  that  we  found  using  computational  more  complex 
methods  in  the  preceding  section. 


a 
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In  1959.  Mantel  and  Hacnszcl  independently  proposed  what  is  essentially  Cochran's  test,  with 
two  minor  modifications  (i)  a  "small  sample"  adjustment  in  the  weights  (w  }  in  the 
denominator  of  C:.  and  (ii)  the  use  of  a  continuity  correction  in  the  numerator.  There  has 
been  a  tendency  for  subsequent  authors  to  refer  to  the  Mantcl-Haenszel  test,  and  to  ignore  the 
fact  that  Cochran  derived  it  first.  It  is  a  measure  of  Cochran's  stature  and  temperament  that 
he  never  took  offense  at  this  oversight.  In  fact,  in  the  7th  edition  of  Snedecor  and  Cochran 
(1980).  as  in  the  preceding  edition,  he  noted  the  importance  of  the  two  refinements  in  the 
Mantcl-Haenszel  version  of  his  statistic. 

6.  The  Impact  of  Cochran's  Research  on  Categorical  Data  Problems 

It  would  be  a  mistake  to  assess  the  importance  of  Cochran  s  work  in  categorical  data  by 
simply  looking  at  the  new  results  he  derived.  In  a  sense,  it  is  the  impact  of  Cochran's  work 
(a)  on  the  research  of  others,  and  (b)  on  statistical  practice,  that  he  would  have  wanted  us  to 
examine. 

As  Colton  (1981)  has  suggested,  one  measure  of  Cochran's  impact  on  the  research  of  others 
is  the  frequency  with  which  some  of  his  important  contributions  arc  still  cued  in  the  literature. 
Table  6.1  gives  the  number  of  citations  listed  in  the  Science  Citation  Index  for  1965-1980  to 
those  three  of  Cochran's  on  categorical  data  analysis  which  were  the  focus  of  our  discussion  in 
Sections  2-5.  (Since  the  recommendations  on  small  expectations  in  Cochran  (1952.  1954) 
supercede  those  in  his  earlier  papers,  the  earlier  papers,  though  important,  are  no  longer  widely 
cited.)  The  remarkable  feature  discernablc  in  Table  6.1  is  that  Cochran's  articles.  25  to  30 
years  after  their  publication,  are  cued  in  the  statistical  literature  more  often  than  they  were  in 


the  1960  s. 
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Citation  Counts 

1965-1969  1970-1974  1975-1979:  1980 

Cochran  (1950)  29  48  59  15 

Paper  Cochran  (1952)  19  18  36  5 

Cochran  (1954)  67  159  196  34 

Table  6.1 

Citation  Counts  from  Science  Citation 
Index  for  1965-1980. 

Snedecor  and  Cochran  (1980)  remains  one  of  the  most  widely  cited  books  in  the  entire 
scientific  literature,  and  it  serves  as  a  methodological  guide  for  statisticians  and  nonstatisticians 
alike.  Throughout  its  discussion  of  categorical  data  analysis  one  can  find,  deftly  woven 
between  the  basic  methods  and  the  informative  examples  and  applications,  many  of  Cochran's 
own  recommendations  and  contributions.  Thus  we  can  expeci  Cochran's  work  to  continue  10 
influence  the  practice  of  scientists  dealing  with  categorical  data. 

About  a  year  prior  to  his  death,  in  the  winter  of  1979.  Bill  Cochran  visited  the  University 
of  Minnesota  to  take  pari  in  a  seminar  on  the  contributions  to  statistics  of  Sir  R.A.  Fisher. 
In  his  lecture.  Cochran  (1980)  made  reference  to  his  own  1940  paper  on  transformations  for 
Poisson  and  binomial  data,  and  the  exchange  he  had  with  Fisher  on  the  topic.  That  lecture 
was  impressive  for  its  modesty  and  the  unpretentious  manner  with  which  Cochran  put  his  own 
important  contributions  into  perspective.  His  1940  paper  was  clearly  correct,  and  provided  an 
innovative  approach  to  a  problem  Fisher  himself  had  posed.  But  when  Fisher  (1954)  reviewed 
the  various  approaches  to  the  analysis  of  non-normal  data  through  the  use  of  variance 
stabilizing  transformations,  he  implied  (incorrectly)  that  Cochran  supported  such  an  approach  in 
preference  to  the  maximum  likelihood  one.  Cochran  in  fact  had  shown  how  to  approximate 

the  ML  solution,  but  in  his  1979  lecture,  he  remarked  that 

In  retrospect  I  agree  that  Fisher's  approach  is  superior.  It  specifies  a  definite 
mathematical  model,  and  uses  maximum  likelihood  estimation,  recognized  as  preferable 


to  least  squares  estimation  with  non-normal  data.  My  results  were  that  analysis  of 
variance  on  the  transformed  scale,  with  or  without  variance-stabilizing  adjustments, 
agreed  closely  with  the  ML  estimates,  and  was  a  good  working  method,  particular!} 
when  extraneous  variation  is  present  so  that  the  assumptions  leading  to  Fisher's  ML 
solutions  do  not  apply. 

When  doing  statistical  research  on  a  topic  such  as  A-  tests  for  categorical  data  we  tend  to 
use  the  recent  literature  as  a  point  of  departure.  Rereading  Bill  Cochran's  papers  on  the 
analysis  of  categorical  data,  one  realizes  that  we  all  have  much  more  to  gain  by  going  back  to 
several  of  his  key  papers  on  the  topic,  and  reflecting  on  the  wisdom  contained  therein,  rather 
than  relying  on  someone  else's  summary  of  their  contents. 


•> 
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